Corpus: fra_news_2007_30K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 95 97 99 99 99
1000 870 973 992 999 999
10000 6554 8907 9759 9947 9984
100000 14980 24096 28413 29620 29861
1000000 14980 24096 28413 29620 29861


Zipf's diagram for sentence endings


Gnuplot diagram

2038 msec needed at 2018-03-02 23:11